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Abstract 

As part of a program to discover improved glycoside hydrolase family 1 2 (GH 1 2) endoglucanases, we have 
studied the biochemical diversity of several GH 12 homology. The H. schweinitzii Cel12A enzyme differs 
from the T. reesei Cell2A enzyme by only 14 amino acids (93% sequence identity), but is much Jess 
thermally stable. The bacterial Cell2A enzyme from S. sp. J JAGS shares only 28% sequence identity to the 
T. reesei enzyme, and is much more thermally stable. Each of the 14 sequence differences from H. 
schwelnitzii Cell2A were introduced in T> reesei Cell2A to determine the effect of these amino acid 
substitutions on enzyme stability. Several of the T. reesei Cell2A variants were found to have increased 
stability, and the differences in apparent midpoint of thermal denaturarion (T m ) ranged from a 2.5°C increase 
to a 4.0°C decrease. The least stable recruitment from H. schweinitzii Cell2A was A35S. Consequently, the 
A35V substitution was recruited from the more stable 51 sp. 1IAG8 Cell2A and this T. reesei Cell2A 
variant was found to have a T m 7,7*0 higher than wild type. Thus, the buried residue at position 35 was 
shown to be of critical importance for thermal stability in this structural family. There was a ninefold range 
in the specific activities of the Cell2 homologs on o-NPC. The most and least stable T. reesei Ce)12A 
variaots, A35V and A35S, respectively, were fully active. Because of their thermal tolerance, S. sp. II AOS 
Cell2A and T. reesei Cell2A variant A35V showed a continual increase in activity over the temperature 
range of 25 °C to 60°C, whereas the less stable enzymes T. reesei Cell2A wild type and the destabilized 
A35S variant, and hi. schweinitzii Cell2A showed a decrease in activity at the highest temperatures. The 
crystal structures of the H, schweinitzii, 51 sp. UAG8, and T, reesei A35V Cell2A enzymes have been 
determined and compared with the wild-type 71 reesei Cell2A enzyme. All of the structures have similar 
Ca traces, but provide detailed insight into the nature of the stability differences. These results are an 
example of the power of homolog recruitment as a method for identifying residues important for stability. 
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Bacterial and fungal cellulases have been intensively stud : 
ied for many years. They are used widely in the detergent, 
textile, and food industries. Cellulose and heraicellulose ac- 
count for more than half of the organic carbon in the bio- 
sphere, and there is continuing interest in the potential use 
of cellulases in the conversion of cellulosic biomasa to fer- 
mentable sugars. Cellulases are glycoside hydrolases found 
in at least -12 families of this very large group of enzymes 
(Henrissat and Davies 2000), and consist of cellobiohydio- 
lascs (or cxoglucanases, EC 3.2.1.91) and endoglucanases 
(EC 3.2.1.4). Glycoside hydrolase family 12 (GH 12) mem- 
bers hydrolyse the p-l,4-glycosidic bond in cellulose via a 
double displacement reaction and a glycosyl-enzyme inter- 
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mediate that results in retention of the anomeric configura- 
tion in the product (Schulein 1997; Birean et al. 1998). 
Structural studies on GH 12 members reveal a compact 
p- sandwich structure that is curved to create an extensive 
cellulose-binding site on the concave face of the p-sheet 
(Sulzenbacher et al. 1999; Fig. 1AJB)- Structures from both 
bacterial (Sulzcnbacher et al. 1997; Crenncll et al. 2002) 
and fungal (Sandgren et al. 2001; Khaderai et al. 2002) 
sources have now been determined, and these provide the 
structural framework for the whole family. As part of a 
program to discover cellulases with improved properties, 
many novel GH 12 endoglucanases have been cloned, se- 
quenced, and expressed (van SoHngen et al. 2001; Goede- 
gebuur et al. 2002). The Cell2A enzyme from the alkalo- 
phiJic bacterium £ sp. 1JAGS was found to be more stable 
to thermal inactivation than the homologous enzyme from 
the filamentous fungus 7". reesei, and was identified as a 
good candidate tor industrial applications (van Solingen et 
al. 2001). The present work is pan of a study of the bio- 
chemical and structural diversity of this sequence-diverse 
family to understand the sequence-structure-function rela- 
tionships within the GH 12 proteins. 

In this study, we have measured the stability and activity 
of several GH 12 homologs and point mutants of T. reesei 




Improving the thermal stability of GH 12 enzymes 

CC112A, in an attempt to identify residues important for 
thermal stability. Significant biochemical diversity was seen 
among the GH 12 homologs (Table la). Notably, the en- 
zyme from the fungus H. schweinitzii (Goedegebuur et al. 
2002) differs from the T. reesei Cel 12A enzyme at only 14 
residues, but is significantly Jess thermally stable. We have 
systematically introduced these 14 differences into the T. 
reesei CeJJ2A enzyme by site-directed mutagenesis, and 
examined their effects on the thermal stability of the en- 
zyme. Several mutations recruited from H> schweinitzii af- 
fected stability, and A35S (aJ! numbering is based on the T. 
reesei Cell 2 A sequence unless otherwise noted) was found 
to be the most destabilizing. Consequently, the A 35V mu- 
tation was recruited from the more stable 5. sp. HAGS 
Cell2A enzyme and was found to be stabilizing. Thus, this 
process has identified buried residue 35 as being of critical 
importance. The large differences in stability between the 
enzymes prompted us to determine the crystal structure of 
two related GH 12 enzymes* the fungal Cell2A from H, 
schweinitzii and cd of the bacterial Ccll2A from the S. sp. 
11AG8, and of the most stable T. reesei Cell2A mutant, 
A35V. Comparison of these three structures was used to 
provide detailed rationale for the observed stability differ- 
ences. 




Figure 1. Schematic ribbon diagram. Top, (A) and aide (fl) views of the II jchwcJnitzti Cell2A crystal structure, color ramped 
according to residue number, Starting with red at the amino terminus and ending with blue at the carbo*yl terminus of the structure. 
The two 0-shccfc the structure arc labeled A and B, with Che individual strands labeled (Al-AG* and B1-B9) according to iheir 
positions in the two fc-shcets. The structures have side chains drawn (or the 14 residues that differ from the 7. rccsci Cel J2A protein 
sequence. Figures 1 and 3 were prepared usirtfe O (Jones ct al. 1991), and rendered with Molfay (Harris and Jonea 2001). 
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Table la. Thermal denaiuraticn data and relative specific 
enzyme activity 



GH12 boraulog 






fit 
error* 


Activity* 1 


T. rcesei CfcU2A 


0-0 


54.4 


0.19 


1.00 


H. xchweimtzii CtMlA 


-5.2 


49.2 


0.08 • 


1.08 


S. sp. UAG* CelI2A cd 


11.3 


65.7 


0-06 


3.84 


5. sp. HAGS CeU2A fi 


12.4 


66A 


0.03 


3.78 


T. koningii Ccll2A 


4.1 


58-5 


0.76 


1.00 


roi-cwm CellX 


-S.3 


45.9 


0.17 


0.42 


F. javamrwn Cell2A 


-0.3 


54.1 


0.07 


ND 



n AT„ value* ant relative to T. rcesei CelBA wild type. 

* The therwa) denaniration experiments were performed at 217 nm, in 0.05 
M Bis-Tris propane, 0.05 M ammonium aeerafc (pH 8.0) by mcreajdng mc 
tempcrauit* from 30 to 90*C with data coUecced every 2 decrees. The 
midpoint of the transition (? J k> an apparent value because tne thermal 
denacu ration is not reversible. 

* T n Pit eiror corresponds to one standard dcviiilion. 

4 The specific enzyme activity was measured irt 10 min incubation with 
0NPC at 4Q°C (pH 5.5). It is expressed as a mole specific activity relative 
to T. rcesei Ccil2A wild type. The Standard deviations are 10% or less. 



Results 

Stability determination 

The CD signal at 217 nm as a function of temperature was 
collected for a]l of the homologs and mutants and fitted to 
two-state models (Fig. 2A). The parameters from those fits 
were used to calculate a curve representing the apparent 



traction of unfolded protein, F^p, and the fitted curve for 
each sample (Fig. 2B). The apparent T m for each sample is 
listed in Table la. As has been reported previously (Ward ct 
al. 1993), the recombinant T. reesei Cell2A proteins ex- 
pressed in Aspergillus niger were hyperglycosylated, as was 
H. schweinitzii Cell2A (data not shown). All stability de- 
terminations reported were on proteins deglycosylated with 
endoglucanase H. For the wiid-type Z reesei CeI12A, this 
resulted in a form with the same T M as the native protein 
expressed in J. reesei (data not shown). This is consistent 
with the direct observation of an identical single NAG resi- 
due on Asn 164 in the native 7*. reesei Cell 2 A (Sandgren ct 
al. 2001) and in the deglycosylated recombinant A35V van- 
ant and H* schweinitzii CeI12A. 

Compared with the T, reesei Cell2A enzyme, which has 
a T m of 54.4°C the H. sckweinitzii enzyme has a T m that is 
5,2°C lower, whereas the S. sp. 11 AGS Cell2A homolog 
has a T m that is 12.6°C higher. The stability ranking by T ra 
(Table la) was the same as that seen by thermal inactivation 
rates at pH8 (data not shown). The variants recruited from 
the tt schweinitzii Cell2A enzyme have T m changes rang- 
ing from -4.0°C to +2^ C C for the most stable mutant. The 
A T m values given in Table lb have a standard deviation of 
0.2°C or less, and T m differences of <0.5°C were not con- 
sidered significant. The most significant changes in stability 
occur within the first 63 airdno acids. The A35S mutant has 
the largest decrease in Stability (A T m =■ -4.0°C), whereas 
mutating the same residue to a valine produces the largest 
increase in stability (A T m = +7.7°C). 
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Figure 2. The raw thermal dan miration data (A) is shown for a subset Of proteins examined. Thcae data and similar traces for all of 
the proteins listed ifl Tables (a and lb were fit to two-state models and used to produce a ftacti on-apparent unfolded, plot (0). 
By this convention. 0 represents native and ! represents unfolded state for the protein. Only a subset of the data sets are labeled. The 
protein concentration fot th* experiments was between 10 and 20 jlM. 
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Table lb. Thermal denaturation data for T. reesei 
CelllA variants 



T. reesei 



Ccll2A variant 


AT m * 




Fit error* 


A35V 


7.7 


62.1 


0.07 


W7Y 


-1.0 


53.4 


0.24 


Tusmei 


I.I 


55.5 


0.13 


A35S 


-4.0 


50.4 


0.14 


S30N 


OJ 


54.9 


0.17 


G41A 


2.5 


56.9 


0.11 


S63V 


-0.8 


53.6 


0.1 1 


A66N 


0.1 


545 


0.10 


S77G 


0.1 


54.5 


0,09 


N9ID 


0.5 


54.9 


0.17 


S143T 


0.5 


54.9 


0.12 


T163S 


0.3 


54,7 


ao7 


N167S 


0.2 


54.6 


aio 


A]$$G 


0.3 


54.9 


0.17 



* AT M vaJues are relative to T. reesei Ccl12A wild type 

b The thermal denaturation experiments were performed at 217 nm. in 0.05 
M BIs-Trjs propane. 0.05 M ammonium acetate (pH 8.0), by increasing the 
temperature from 30 to 90 D C with data collected every 2 degrees. The 
midpoint of the transition (T„) is an apparent value became the thermal 
denaiOradon h nOI iCVettible. 

* T IB Fit error corresponds to oo* standard deviation. 



Relative enzyme activity 

The expression of all of the Cell 2 homologs and variants in 
A, nlger was detected by activity on derivatized cellulose 
(Goedegebuur et a). 2002), and o-Nitrophenyl p-cellobio- 
side (oNPC) assays were used to monitor the purifications. 
Thus, all of these proteins are active enzymes. There was a 
ninefold difference in the specific activities of the Cell2 
homologs on oNPC (Table la). The T. reesei Cell2A vari- 
ants with the greatest and least stability (A35V and A35S, 
respectively) both .showed -30% higher activity than wild 
type (Fig. 3). 

The activity of selected Cell 2 enzymes as a function of 
temperature (Fig. 3) was determined, and corresponded to 
apparent activation energies between 10 and 1 1.5 kcal/mole 
in all cases. The Cell2 enzymes with greatest thermal sta- 
bility, Cell2A S. sp. 11AG8 and T. reesei Cell2A variant 
A35V, showed a continual increase in activity over the full 
temperature range from 25°C to 60°C (Fig. 3). The less 
stable homologs, T. reesei Cell2A and H. sclvwe'mitzii 
CC112A, and the destabilized T. reesei Ccll2A variant 
A35S, showed a decrease in activity at the highest tempera- 
ture, presumably due to thermal inactivarion. 



Structure determination 

To better understand the structural basis for thermal stability 
in tbis family of enzymes, the structures of H. sehweinittii 
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100 




Temparatura (*C) 

Fignre 3. The tcrapcratnrc dependence of enzyme activity was assayed by 
hydrolysis of the chromogenic substrate oNPC tn 0.05 M Bis-Tris propane, 
0.03 M ammonium acetate (pH 5.5) over 10 mm at 25"C, &Q*C, 50 9 C, and 
60 a C. Data arc shown for 5. sp. JJAG8 Ccll2A cd (■), T. reesei Ccll2A 
variant A35V (□>, T. reesei Ccll2A (4). T. reesei Cdl2A variant A35S 
(O), and FT. schweinitdi Ccll2A (•). (t is expressed as a molar specific 
aciiviry relative to T. reesei 

Cell2A, the T. reesei Cell2A A35V mutant, and S. sp. 
DAGS Cell2A cd have been determined and refined to 
reasonably high resolution (Tables 2a, 2b). The protein 
structures have the expected fold of a GH 12 enzyme. They 
consist mainly of 15 ^-strands building up 2 3-sheets, A and 



Tabic 2a. X-ray data collection and processing statistics 



GH 12 


T, reesei 


S> sp. HAGS 


H. sckweinitzii 


da laser 


A35V 


wild type 


wild type 


Collected 


ESRP 


Rotating 


ESRF 




JJDMEHt 


Anode 


ID14EH4 


Detector 


MAR 


Raxisn 


ADSC 


Wavelength A 


CCD 265 




Quanta 4r 


0.93 




0.98 


Oscillation ransc O 


0.5 


03 


0.5 


Space group 


P2, 


P2.2.2, 


P2, 


CcO parameters (A) 








a « 


68.3 


65.2 


624 


b = 


71.3 


54.6 


774 


c - 


119.3 


62.6 


83,4 


p = 


91.5° 




984° 


Resolution range (A) 


25-1.5 


29-1-5 


30-1.7 


Resolution range 








outer sheJJ 


1.53-1.50 


1.53-1.50 


1.73-1-70 


No. of observed 








reflections 


490.560 


88.448 


346,287 


No. of nniquc 








reflections 


172,895 


26,559 


85,680 


Average multiplicity 


2.8 


33 


4.0 


Completeness (%) v 


94.4 (94.3) 


72.9(63-5) 


98.9 (S4.6) 




3.7 (38.1) 


5.4 (16.1) 


9.7 y »2.u) 




24.9 (2.3) 


15.4 (8.6) 


14.0 (5.7) 



"European Synchrotron Radiation Facility (ESRf). Grenoble. France 
b Numbers in parentheses are for their highest resolution bins. 
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B ? A consisting of 6 p-strands (A1-A6) and B of 9 
3-strands (B1-B9) stacked on top of one another, forming a 
3-sandwich (Fis- 1 A3)- The concave surface of the larger 
3-sheet B produces a large substrate-binding cleft across 
one face of the enzyme (Sulzenbaeher et a). 1997). From 
structural comparisons with GH 12 homoiogs with known 
structure in which the catalytic residues have been identified 
(Zechel et aL 1998; Sulzenbaeher et aL 1999), the identities 
of the catalytic nucleopbUe and the acid/base have been 
deduced. These correspond to glutamyl residues 116 and 
200, respectively, in 7. reesei Cell2A, an identification sup- 
ported by sice-directed mutagenesis (Okada et al. 2000). 



H. schweinitzii Cell2A structure 

The fungal Cell2A enzyme from H. schweinitzii crystal- 
lizes with four NCS-relaled molecules in the asymmetric 
unit. It differs at only 14 residues in the protein sequence 
(Fig. I ; Table lb) from the fungal 7*. reesei Cell2A enzyme, 
and there are no insertions or deletions. The complete set of 
Ca atoms from the four NCS molecules in the //. schwein- 
itzii Cell2A structure and the six NCS molecules in 71 
reesei Cell2A structure can be superimposed with pair-wise 
RMSD in the range from 03 to 0.5 A, Some of the 
biggest main-chain differences between the two structures 




can be found in the two loops corresponding to residues 
1 1— 16 and 37-43, connecting g-strands Bl to B2 and A2 to 
A3 in the structure (Fig. 1A3)> 

The H. scfiweinitzii Cell2A enzyme has a 5.2°C lower 
T m than the T. reesei enzyme (Table la). Most of the 14 
differences m sequence are located on the protein surface 
and distributed over the whole molecule (Fig. 1A3)- The 
side chains at many of these sites point out into the sur- 
rounding solution. All of these substitutions (except N91D) 
are from a neutral to another neutral amino acid, and most 
have little or no effect on the T m of the enzyme (with A T m 's 
in the range from 0.1°C to 0.5°C) (Table lb). There i6 a 
clustering of substitutions on the first (3-strands in the struc- 
ture, A2-A3 and B1-B3 (Fig. 1A,B), in which some of the 
individual substitutions have a large effect on the T m of the 
7. reesei Cell2A enzyme. This region also has some of the 
largest structural differences between the structures of the 
two fungal enzymes. The substitution in this region that has 
the biggest effect on the T m is the alanine to serine substi- 
tution at residue 35 on p-strand A2 (Fig. 4A). This substi- 
tution causes a reduction of the T in by 4.0°C when intro- 
duced into 7. reesei Cell2A Ofabte lb). The Hkely expla- 
nation for this is the introduction of a hydrophiJic residue in 
the hydrophobic environment at the edge of the two 
P-sheets, disrupting the hydrophobic interactions with the 
side chains of the three surrounding hydrophobic residues 
(F10, V17, and L38, with interatomic separations in the 




Figure 4. (A) Interactions and conformational changes close to residue 35 Of the f0n#d GH 12 en2Vjrne« fron* T. rccuti (wild type and 
A.35V have carbon atoms colored yellow and goJdenrod)* and H. schweinitzii (carbons colored gold). Red bubbles indicate contacts 
in T. totfci A35V CeJ.UA, blue bubbles in H. schweinitzii CcI12A. (0) Interactions and conformational changes close 10 residue 34 
of the bacterial GH 12 enzyme* from 5. sp. I) 'ACS (carbon* colored gold), and 5. Iividara (carbons colored yellow). Red bubbles 
indicroe contacts in S> lividtms &1B2, blue bubbles tn £ sp. NAGS Cell2A. 
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n*nge from 3.4 to 3.9 A) (Fig. 4A). Tbere are two other 
substitutions in this region of the structure, W7 Y in the loop 
between (^-strands Al/Bl and S63V on p-strand B3 that 
cause a reduction in the T m by 1 D C and 0.8 °C t respectively, 
when introduced into 7*. reesei CeI12A. The T m decrease 
caused by the S63V substitution is probably due to Jocal 
rearrangement to alleviate a close contact. Modeling the 
observed valine side-chain conformation of the M. schwein- 
itzii enzyme, on the T. reesei enzyme gives a 3.0 A clash 
between the CH of V63 and the Cp of A42_ In the H. 
sckweinitzii structure, we observe a movement resulting in 
a distance between the C72 of V63 and the Cp of A42 of 
3,9 A. 

Interestingly, there are three substitutions in this region of 
the less stable H. schweinitzii Cell2A enzyme G41A, and 
Tl 1S/T16I that cause a T m increase (2.5°C and 1.1°C, re- 
spectively) when introduced in T. reesei Cell2A (Table lb). 
The T, u increase of 2,5°C caused by the G41A substitution 
is JikeJy due to a new hydrophobic interaction between the 
Cp of A41 and the C72 of residue T214 on 0-strand A4. 
There might also be a contribution from the reduction of the 
chain entropy in the unfolded form of the protein. The in- 
creased T m caused by the TllSAf 161 substitutions is harder 
to explain, as both of these residues sit in the bottom of the 
substrate-binding cleft on p-strands Bl and B2 with their 
functionfd groups pointing our into the solution. Moreover, 
they do not have any obvious interactions with other resi- 
dues that could stabilize the structure. 

T. reesei CeU2AA35V structure 

The T, reesei Cell 2 A A35V mutant enzyme crystallizes 
with six NCS-reJated protein molecules in the asymmetric 
unit, making up three pairs of interacting molecules. This 
crystal packing is similar to the wild-type, structure 
(Sandgren et al. 2001). The pairs of molecules interact with 
an approximate twofold symmetry. Three aromatic side 
chains from each monomer that are important for Substrate 
binding (W7, W22, and Ylll) are buried in this quasi- 
dimer interface. The indole rings of W7 and W22 ace close 
to the quasi-twofold axis, but do not stack. The side chain of 
Y1H packs against the edge of the P-sandwich, between the 
reverse turns at residues 6 and 25. The three pairs arc not 
identical, but show differences that are presumably a result 
of the crystal packing. 

Because of the relatively high resolution of the study, we 
get insights into conformational variation and potential 
crystal contact artifacts. The A35V mutant crystallizes in 
the same space groups as the wild-type enzyme, but with 
some changes in the unit cell constants that arise from 
slightly different crystal packing. The biggest differences 
between the six different NCS molecules correspond to loop 
regions that take on different conformations in the different 
NCS molecules, and these axe affected by the crystal con- 



tacts. Of the loops helping to form the substrate-binding 
site, those at residues 153, 111,55, and 26 arc most affected, 
and show clustering into two or more preferred conforma- 
tions. On the convex side of the p-sheet, the biggest varia- 
tion occurs at residues 141 and 40. Both are dominated by 
one outlier and, in the case of the loop at residue 40, the 
density of the outlier is of rather poor quality. The wild-type 
enzyme also shows loop-flexing motions, but with a smaller 
variation as a result of the crystal packing. 

The introduction of a valine at position 35 influences 
packing. Residue A35 is located on p-strand A2 on the 
smaller p-sheet A close to the amino terminus (Fig. 1 AJB), 
The functional group of the residue points into the core 
between the interacting p-sheets, where it interacts with a 
number of spatially adjacent side chains. Residue 35 does 
not cause a major conformational change or local rearrange- 
ment in the structure, however. The side chain takes on a 
preferred rotamer conformation (x = +180°) and the 
methyl groups make good van dcr Waals contacts with the 
neighboring hydrophobic residues' side chains (F10, V17, 
L38, with contacts in the range from 3-6 to 3-9 A) (Fig, 
4A). In the wild-type enzyme, the packing of the alanine 35 
Cp methyl group is not so tight (Fig. 4A). The approxi- 
mately equivalent pairs of Cots, from the T. reesei Cell2A 
wild-type and A35V structures can be superimposed with 
pair-wise RMSD's in the range from 0.4 to 0.6 A. 

S. sp. 11 AGS Cell2A structure 

The cd of the bacterial S. sp. 1JAG8 Cell2A enzyme crys- 
tallizes with only one molecule in the asymmetric unit. The 
5. sp. HAGS Cell2A cd structure is most similar to the 
Streptomyces tividans GH 12 enzyme CelB2, with which it 
shares 72% sequence identity (Fig. 5), and there are no 
insertions or deletions in the sequences. In the CelI2A cd, 
there is no observable density for the residues in the linker 
region beyond Ala 222. The complete set of Ca atoms from 
the two structures can be superimposed with a RMSD of 
0.49 A. 

The S. sp. 11 AGS Cell2A cd structure has good stereo- 
chemistry with only five nonglycine residues outside of a 
stringent boundary Ramachandran definition (Kley wegt and 
Jones 1996a). There are two disulfide bridges in the struc- 
ture. The one between Cys 5 and Cys 31, linking p-strands 
Al and A2, is conserved throughout the whole GH 12 
(Sandgren et al. 2001). The second disulfide bridge is 
formed between Cys 64 and Cys 69, and is found in an 
extended loop between P-strands B3 and A5 that is part of 
the substrate-binding cleft of the enzyme. Structural align- 
ment shows that the bacterial enzymes have an insertion in 
this loop compared with the fungal GH 12 enzymes 
(Sandgren et al, 2001). This suggests that the purpose of this 
disulfide may be to stabilize the insertion. 
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Two Ce)B2 structures have been described; in one, the 
methionine M122 (S. sp. I JAGS Cell2A numbering) at the 
active site has been oxidized (SuJzenbacher et al. 1997), 
whereas die other structure is in complex with a fluoroccl- 
lotrioside that has been designed to trap a covalent interme- 
diate to the active site nucleophile (Sulzenbacher et al. 
1999), The oligosaccharide has been modeled in two con- 
formations, one as the covalent intermediate to the nucleo- 
phile El 20, and the other as free substrate. The nucleophile 
has also been modeled in two conformations m the complex 
structure. In our bacterial structure, the active site nucleo- 
phile is most like the one in the major conformation de- 
scribed for the CclB2 structure with a free fluorocellotrio- 
sidc bound in the binding clefL The hydrogen-bond inter- 
actions arc essentially identical, including the close contact 
of the acidic groups of the nucleophile and D 104 (2.6 A 
separation), and the conformations of the ring of identical 
supporting side chains (Sandgren et al. 2001). The residues 
lining the substrace-binding groove are highly conserved 
and very similar in structure. Some of the largest differences 
involve side-chain shifts of ~0.5 A for the aromatic rings of 
W24 in the -2 site, and F39 and Y66" in the -3 site. These 
differences may be due to substrate binding in CelB2. The 
czj-peptide at P75 is also conserved between the two struc- 
tures. 

One of the largest main-chain differences is at residue 
A/V34, the equivalent site of the A35V mutation that we 
show affects temperature stability in the 71 reesei Cell2A 
enzyme: The loop corresponding to residues 33-38 has a 
small rigid body shift, so that in the CelB2 enzyme it moves 
closer to the upper sheet (Fig. 4B). As with the 71 reesei 
Cel 1 2A enzyme, residue 34 is located on (3-strand A2 at one 
edge on the {J-sandwich as shown in Figure 1, A and B. In 
CeIB2, the C(3 of A34 has hydrophobic contacts with the 
three equivalent inward-pointing residues from the upper 
sheet (Tl 1,113, and V19, with interatomic separations in the 
range from 3.6 to 4.2 A). In the S. sp. 11 AGS Cell2A 
structure, the methyl groups of V34 also remain in good van 
der Waals contact with the equivalent side chains (separa- 
tions 3.5-4-3 A), (Fig. 4B). The effect of sequence differ- 
ences at this residue has only a small effect on the accessi- 
bility of solvent to the inner, more closely packed p-sand- 
wich. 

Discussion 

Glycoside hydrolase family 12 members show a wide varia- 
tion in their thermal stability. The T m variation from the 
least to the most stable enzyme examined is 20.9 D C (Table 
la). These types of stability differences between homolo- 
gous proteins are common, and there has long been an in- 
terest in identifying the sources of the increased stability in 
the extremophiles (Jaenicke 2000). There have been cases in 
which, for instance, changes in loop length or compactness, 



Improving the thermal stability of GH 12 enzymes 

disulfides, electrostatics, and hydrophobic interactions have 
been shown to contribute to the stability differences among 
homologs (Jaenicke and Bohm 1998; Vieille et al. 2001). 
However, no genericaJJy reliable rules have emerged to pre- 
dict stabilizing changes from comparisons of sequences 
aJone or, even modeled structures. Although we find a high 
level of structural homology between the Cell2A proteins 
from the mesophile (71 reesei) and the more stable enzyme 
from the alkalophilic bacterium (S. sp. 1 1 AGS), it is by no 
means obvious, given the low sequence identity of 28%, 
which changes are needed to stabilize the T reesei protein. 
However, we were able to use the cbaracterization of a Jess 
stable homoiog, together with examination of the T. reesei 
Cell2A structure and sequence analysis, to guide our choice 
of recruitment from the more stable homoiog. 

Ceil2A from the fungus H. sckweinitzii has a 
(49.2°C) that is 5.2°C lower than the T^ (54.4°C) of the T, 
reesei Cell2A. As the H. schweinUzii homoiog differed 
from the T. reesei enzyme at only 14 amino acids, it was 
possible to make all of these changes in individual 71 reesei 
Cel12A variants. Mutating the A35 in the 71 reesei enzyme 
to a serine, the equivalent amino acid in H. schweinitzii 
Cell2A, results in a drop of the T m by 4.0*C (Table lb). An 
examination of a sequence alignment of the homologous, 
fungal, and bacterial Cell2A enzymes (Goedegebuur et al. 
2002) shows a statistical preference for alanine or valine at 
that position. The Ccl 12A from £ sp. 11 AGS has a valine at 
that position, and has a T m (66.8°C) that is l2.4 a C higher 
than thai of the 71 reesei Cel 12A enzyme. This analysis, and 
examination of the 71 reesei Cell2A structure, led us to 
recruit the A35V substitution from S. sp. 11 AGS Cell2A. 
Mutating this single residue to a valine results in a 7.7°C 
increase in T^, to a value that is only 4.7°C leas than that for 
the S. sp. 11AG8 CellZA enzyme. The effects of the A35S 
and A35V mutations account for most of the T m oUfiference 
between wild-type 71 reesei CcI12A and the GH 12 homo- 
log from which they were recruited. Thus, a clear finding 
from this effort was that the buried amino acid at position 35 
plays a key role in modulating the stability of these en- 
zymes. 

The GH 12 homologs showed a ninefold range in their 
specific activities on oNPC In contrast to their effects on 
stability, the recruitment of the A35S or A3 5 V into 71 reesei 
CelI2A has little effect on specific activity. The stability of 
the GH 12 proteins to thermal denaturation was reflected in 
the effects of temperature on their activity. As expected, the 
homoiog with greatest thermal stability, Ccll2A S. sp. 
11 AGS* showed a continual increase in activity over the 
temperature range from 25°C V to 60°C (Fig. 3). The Jess 
stable K schweiniizii Cell2A enzyme showed a decrease in 
activity at the highest temperature, presumably due to ther- 
mal inactivation. The recruitment of the A35V and A35S 
into T. reesei Cell2A mimicked the behavior of the parent 
enzymes, with A35V showing a lugh-temperature activity 
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benefit compared with wild type, or to the least stable vari- 
ant A35S, 

Examination of the Cell2A structures (Fig. 4 A) provides 
rationalization for the effects Oft stability. An alanine, ser- 
ine, or valine at this position has little effect on the back- 
bone structure or on the solvent accessibility of interacting 
residues. However, when this residue is a serine, it intro- 
duces a hydrophilic residue into the hydrophobic core be- 
tween two (B-sheets. When it is a valine, it results in a tighter 
set of van der Waals interactions with three neighboring 
side chains in particular, and closing entry to the P-sheet 
sandwich core. The stabilizing effect of improved interior 
paclcing has been shown in some other proteins (Matswnura 
et al. 1988; Golovanov et al. 2000: Ohmura et al. 2001). 

After the completion of this study, die structure of a 
highly thermostable Cel 1 2 cndoglucanase from the thermo- 
philic bacterium Rhodothermus marinus was solved (Cren- 
nell et al, 2002). On the basis of a purely structural com- 
parison with the two other Cel 12 structures then available 
for T. reesei Cell2A (Sandgren et al. 2001) and 5. lividans 
CelB2 (Sulzenbacher et al. 1997), the authors suggested that 
the features that may be largely responsible for the extreme 
stability of the R. marinus Cell2A are an increased number 
of surface ion pairs, and reduced mobility of a specific loop. 
In contrast to the CelB2 structure, this 153-158 loop in our 
bacterial Cel 12 structure, 5. sp. HAGS Cell2A, has con- 
tinuous density and is not particularly mobile. The loop's 
main-chain temperature factors (ca. 21 A 2 ) are similar to the 
reported values for the 157-161 loop in R. marinus Cell2A 
(Crennell et al. 2002), less than 1.5 times the average (15.6 
A*) and not greater than those for any other flexible loop in 
the protein. Similarly, the equivalent 149-154 loop in T. 
reesei CelI2A has temperature factors comparable with 
those for the loop in the S. sp. J1ACS Cell2A structure. 
None of the mutations introduced in this study would be 
expected to affect either the number of ion pairs or this 
specific loop. The R. marinus CelJ2A protein has only 28% 
identity with T. reesei Cell2A, and fits best in the same 
Cel 1 2 subfamily (12-3), as all of the Streptomyces members 
(Gocdcgcbuur et al, 2002). It is interesting to note that the 
amino acid at the position equivalent to A35 in T. reesei 
Cell2A is an alanine for five 12-3 members. However, it is 
a valine in the remaining two, R. marinus Cell2A and 5. sp, 
U ACS Cell2A, both of which show increased thermal sta- 
bility. 

The clear identification of a single residue responsible tor 
large differences in stability within a structural family may 
be unusual. It is obvious from other studies (Lehroann and 
vvyss 2001; Lebmann et a!. 2002) and from the limited 
sampling presented in this Study that not all recruitments 
from more stable homoJogs are stabilizing. For example, the 
W7Y, 5(53 V, S77G, T163S, andN167S variants of T. reesei 
Celi2A all represent recruitments of the side chains also 
found in S, sp, J J AOS Cell 2 a, but none were stabilizing 



(Table lb). Conversely, not ail recruitments from the less- 
stable homolog were destabilizing. In contrast to A35S, 6 of 
the 14 substitutions recruited from the H. schweinitzii 
CC112A Stabilized T. reesei CeJ12A- Recruiting mutants 
with increased stability from less-stable homologs has been 
reported previously in other families (Shaw et al. 1999; 
Lehmann et al, 2000; Perl et al. 2000). There is another 
indication of the complexity of protein stability even in this 
relatively simple comparison of two very close (structural 
and sequence) homologs; summing the A T m *s observed for 
each of the recruitments from H> schweinitzii Cell2A into 
the T. reesei protein gives a net value close to zero, but the 
H. schweinitzii tnzytnt has a T m that is 52°C lower than the 
71 reesei enzyme (Table la). la the well-studied case of T4 
lysozyme, the effects of point mutations on stability were 
shown to be additive (Zhang et al. 1995). The lack of simple 
additivity seen in this study may be due partly to the spatial 
clustering of eight of the changes on the first six (B-strands, 
leading to some interdependence of their effects. There may 
also be some added complexity arising from the fact that the 
thermal den aru ration is irreversible, so that stability is being 
measured as apparent T m values rather than as the preferred 
thermodynamic free energy values. T. reesei Cell2A is 
known to denature reversibly in urea (Arunachalam and 
Kcllis, Jr., 1996) but, despite extensive efforts, we could not 
find conditions under which we could compare the revers- 
ible denaturation of this wild-type protein with that of vari- 
ants of different stability or of other Cell 2 homologs. 

In this work, structural and stability studies of closely 
related proteins provide insight on how specific residues 
contribute to protein stability in glycoside hydrolase family 
12. Several stabilizing mutations have been identified and 
one expectation for future work is that these could be com- 
bined to give greater increases in stability. We are also 
continuing these studies to include other members of this 
family. 

Materials and methods 

Protein expression and purification 

The DNA encoding for T. rewei Cel!2A was amplified from a 
cDNA clone (Ward et al. 1993) by use of PCR primers that intro- 
duced a B$m restriction endonuclease site at the 5' end of the 
Cell2A gene (immediately upstream of the first ATG codon) and 
an Xbal site at the 3' end (immediately downstream of the stop 
codon). The amplified fragment was then subcloned as a Bglll- 
Xbal fragment into pUC19 vector. All mutants were made in this 
plasmid by use of QuikChangc mutagenesis methods (Stratagcnc) 
as single-point mutations, with the exception of the double-mutant 
T11S/T161, as examination of a prdinurouy structure suggested 
that these side chains could interact. The variant or wild- type gene 
was then subcloned into the Aspergillus expression vector pCPT- 
pyrQ and the sequence-verified vector was transformed into A. 
niger (Berka and Barnett 1989), The resultant strain was grown in 
shake flasks or a fcrmcntor. The culture supema tents were treated 
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overnight with O.I8 mg/mL of endoglucanase H at 37°C. After 
dcglycosylation, ammonium sulfate was added to a final concen- 
tration of 0.5 M, and the supernatant* were centrifuged. The T. 
recsei Cell2A proteins were then purified from these supematants 
by column chromatography. Approximately 1 mL of Butyl Seph- 
arose 4 Fast Flow resin (Amersham Biosciences) per 10 mg of 
CcJUA protein was loaded and equilibrated in a disposable drip 
column with 0,05 M Bis Tris Propane, 0.05 M ammonium acetate 
(pH 8). containing 0,5 M arnmonium sulfate- The supematants 
were loaded and the column washed with 3 vol of equilibration 
buffer, before elution with 3 vol of the same buffer without am- 
monium sulfate. Each column volume was collected as a separate 
fraction with the pure Cell 2 A proteins appearing in the second 
elution fraction. The identity of the purified proteins was con- 
firmed by tryptic mapping by HFLC Mass Spectrometry (data not 
shown). 

The genes encoding the T. koningii Cell 2 A, H. schweinitzii 
Cell2A> G. roseum Cell2C, and F.javanicum Cell 2 A endoglu- 
canases were cloned, expressed in A niger (Goedegcbuur et aL 
2002), and purified essentially as described for the T. reesei 
Cell2A, with the following changes in the chromatography; The 
EndoH-lreated cell-tree supernatant* of the first three were loaded 
and washed in the presence of 1 M ammonium sulfate; T. koningii 
Cell2A enzyme was purified by hydrophobic interaction chroma- 
tography (HIQ using the Butyl Sepharose 4 Fast Flow resim H. 
schweinitzii CcJ12A was purified On an Octyl Sepharose 4 Fast 
flow column* and G. roseum Ccll2C was purified On a Phenyl 
Sepharose 6 Fast Flow column; F. javanicum Cetl2A was purified 
by ion-exchange chromatography on Poros 20 HQ (Applied Bio- 
systems) in 0.02 M NaHiPOVNaOH (pH 6,8), using a 20-column 
volume gradient from 0 to 1 M NaCl. 

The S. sp. ? J AG ft Cell2A enzyme was expressed heterolo- 
gously in Streptomyces lividons (US patent no, 6287839). Ultra- 
filtered and concentrated fermentation broth was first treated with 
10% NajS0 4 (w/v) lo remove some contaminants by a slow, over- 
night precipitation at 37°C, followed by ceritrifugation. 5. sp. 
JIAC8 Cell2A was precipitated from the supernatant by adding 
solid (Nrij) 2 S0 4 to L.3 M, and centrifuging after holding the so- 
lution at 4°C for I h. The pellets were solubilized in -1/10 of tbe 
original volume of 0,1 M sodium acetate, 0-02% Tween 20 (pH 
5,5). The solubilized pellet was diluted 20-fold with 0.02 M 
NaH 2 PO«/NaOH, 1.0 M (NrUaSO* at pH 6.8 (fflC cquibbrarion 
buffer), and loaded onto an Octyl Sepharose 4 Fast Flow column. 
The column was washed with a 3 -column volume of equilibration 
buffer, and then eluted with a step of buffer containing 0-5 M 
(NHJ2 S °4 (P H Earlv elution fractions containing both the 
Cell2A cd, and some high molecular weight contaminants (not 
characterized) were not pooled. The most pure fractions (s97% 
pure) were pooled and then concentrated and diafiltered into 0.1 M 
ammonium acetate (pH 5.5). The final concentration to -15 mg/ 
mL was done using Millipore Ultrafree Centrifugal Filter Devices 
with Biomax 10,000 Dalton molecular weight cut-off membranes. 
The apparent mol ecular weight of the purified protein on an SDS 
polyacrylamide gel was 25-26 kD. This is the molecular weight 
expected for the S. sp. JlAG8CtM2Acd (US patent no. 6287839). 
The carboxy-tcrmmal cellulose-binding module (CBM) is re- 
moved by proteolysis of the linker between the cd and the CBM, 
resulting in two forms of the cd, 230 and 233 residues in length. 



Thermal denaturation experiments 

CD experiments were performed on an Aviv 62ADS spectropho- 
tometer (Protein Solutions), equipped with a five-position thermo- 
electric cell holder supplied by Aviv, Buffer conditions were 0.05 



M Bis-Tris propane, 0.05 M ammonium acetate adjusted topH 8.0 
with acetic acid. The final protein concentration for each experi- 
ment was in the range from 10 to 20 fCvl. Dam was collected in a 
0.1 -cm path length ceil. The thermal denaturation experiments 
were performed at 217 nrn, the wavelength in the far-UV spectra 
with maximum signal difference, as expected for a predominantly 
fi- sheet protein. The temperature was increased from 30 to 90°C 
with data collected every 2°, The equilibration time at each tem- 
perature was 0. 1 rnin and data were collected for 4 sec per sample. 
The thermal denaturation data were fitted to a two-state transition 
(Chen et al, 1992) using Savuka software provided by Dr. Osman 
Bilsel (University of Massachusetts Medical School, N. Worcester, 
MA). The mid-point of the transition <J„) is an apparent value, 
because the thermal denaturation of all the Ccll2A proteins stud- 
ied was not reversible. Because of this and a slow kinetic compo- 
nent due to aggregation, sufficiently fast scan rales were chosen, 
and experiments were carefully controlled for time spent in the 
thermal denaturation transition* so that small temporal variations 
did not lead to apparent T m differences. 

The pH of maximum thermal stabiuty for all Cell2A homoiogs 
studied was pH 5 (data not shown) as reported for the urea dena- 
turation of T. recsei Cell2A (Arunachalam and Kellis. Jr., 1996). 
A suboprlmal condition of pH 8.0 was used to facilitate detection 
of thermal stability differences. 

Specific enzyme activity 

To evaluate specific enzyme activity, and to monitor expression 
and purification of all the Cell 2 proteins, an oNPC (Sigma N 
4764) hydrolysis assay was used In a microtiter pJate, 100 U>L 50 
mM sodium acetate (pH 5.5) and 20 uX 25 mg/mL oNPC in assay 
buffer was added. Once equilibrated, 10 u.L of ccllulasc was added 
and (he plate incubated at 40*C for 10 rnin, unless otherwise speci- 
fied. To Stop the reaction, 70 p.L of 0.2 M glycine (pH 10-0) was 
added. The plate was then read in a microti tcr plate reader at 410 
nm_ As a reference, 10 \lL of a 0.1 rng/mL solution of 7. reesei 
Cell 2A enzyme provided an OD of around 0.3. The concentration 
of the T* reesei Cel 12 A enzyme was determined by the absorbance 
at 280 nm, using an extinction coefficient of 7871 1 M" 1 cm -1 
(3J52 g/L" 1 ), determined experimentally by the method of Edel- 
hoch as described in Pace et al. (1995). Extinction coefficients for 
the other Cell 2 homologs were calculated on tbe basis of their 
amino acid compositions (Pace et al- 1995). 

Protein crystallization 

The reesei Ccll2A A35V variant was crystallized under con; 
chtioiis similar to the wild-type enzyme (Sandgren et al. 2001) 
using 200 mM cacodylate buffer (pH 6.0), 200 mM calcium ac- 
etate and !0%-30% (w/w) mono-methyl-edier (mme) PEG 2000, 
at 2Q-24°C with hanging and sitting drops (McPherson 1982). 
Crystallization drops were prepared by mixing equal amounts of 
protein solution (20 mg/mL) and crystallization agent Large 
single, wedge-shaped crystals grew to a maximum size of I mm in 
alt directions within 1-2 d- The crystals belong 10 the space group 
P2, wiih cell dimensions a = 68.3 A, b = 7L3 A, c - 119.3 A, 
and £ = 91.5°. and have a calculated V m of 1.9 (Matthews 1968) 
with six molecules in the asymmetric unit. 

The ammonium aceiaxc cOMcenixatien in the $. sp- If AGS 
Cell2A protein sample was reduced to 0.01 M by dilution with 
water, and the enzyme ^concentrated to 15 mg/mL by ultrafiltra- 
tion. Crystallization experiments were set up using the vapor-dif- 
fusion method (McPherson 1982) with sitting drops at 25*C. A 
total of 5 |aL of precipitant was added to 5 jxL of protein solution 
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in a micro-bridge (Hampton Research), suspended over 0.5 raL of 
reagent in the well of a VDX crystallization plate, and sealed with 
a cover-slide. Crystals were obtained with the Hampton research 
crystal screen I (Jancarik and Kim 1991). After optimizing the 
conditions, large single crystals (1 x 1 x 0-5 mm) were obtained 
from 10%-2Q% (w/w) name PEG 5000, 200 mM sodium cacodyl- 
ate (pH 5.0-6.0). The crystals belong to space group flfifix with 
cell dimensions a = 65.1 K b « 54.5 A, c = 6X5 A, and have a 
calculated V m of 2.4 (Matthews 1968) with one molecule in the 
asymmetric unit. 

H> schweinitzii Cell 2 A crystals were obtained from a crystalli- 
zation agent containing 200 mM caeodylate buffer (pH 6.0), 200 
mM ammonium acetate, 109&-309& (w/w) nunc PEG 2000, using 
the vapor-diffusion method with hanging and sitting drops (McPh- 
crson 1982), at 20-24"C. Crystallization drops were prepared by 
mixing equal amounts of protein solution (15 tng/mL) and crys- 
tallization agent to a final drop-size of 6-S Initially, bunches 
of elongated crystals grew to a maximum size of 0.1 x 0.01 x 0.01 
mm within 1 to 2 d. To slow down the crystallization process, 1% 
isopropyl alcohol was added to the crystallization solution. Single, 
larger crystals (0.1 mm in all directions) were obtained after 6 mo 
of incubation. These crystals belong to space group P2 t with the 
following cell dimensions: a = 62J A, b = 77,5 A, c =» 83.4 A, 
and p = 98.5°, and have a calculated V m of 2.0 (Matthews 1968), 
with four molecules in the asymmetric unit. 



X*ray data collection 

All crystals were equilibrated in 25%-40% mme PEG 2000, 200 
mM sodium caeodylate (pH 5.0), mounted in a eryoloop, plunge 
frozen in liquid nitrogen (prior to transportation to the synchro- 
tron), or flash frozen in a nitrogen cryo- stream mounted on the 
detector. All data sets were collected from single crystals and at 
100 K. Data collection and processing statistics for me structures 
are given in Table 2a. 

The & sp. II ACS Cell2A data set was processed with the soft- 
ware distributed with the R-axis-JI image plate detector (Molecular 
Structure Corp). The H. sehweinUzu and T. reesei A35VCell2A 
data sets were processed and scaled with DENZO and 
SCALEPACK (OtwinowsM and Minor 1997). 

All subsequent data processing, after image integration, were 
performed using the CCP4 pacVage (Collaborative Computational 
Project Number 4 1994) unless otherwise stated. 



Structure solution and refinement 

The bacterial S. sp. 11 AGS Cell2A cd structure was solved by MR, 
using X-pior 3,1 (Branger 1992b), with the bacterial S. livkkms 
CelB catalytic core structure (FOB code 1NLR) as ihe search 
model. The resulting map clearly revealed the differences between 
the search model and the new structure. The initial structure model 
was built in Xfit, part of the Xtalview suite of programs (McRec 
1999) and refined in X-pJor 3.1 (BrUnger 1992b). The last two 
rounds of model building and refinement were performed with 
alternating cycles of O (Jones et al. 199 1) and Rcftnac 5.0 (Mur- 
shudov et al. 1997). 

The structure of H. schweinitzii Cfci 1 2 A was solved by MR with 
Amorc (Navaza 1994), using all of the homologous wild-type T. 
reesei Cell 2a structure (residues 1-21 S) as a search model. The 
MR search gave one clear solution with four molecules in the 
asymmetric unit. The initial map, calculated using the phases from 
the MR solution, wa$ improved by exploiting the fourfold NCS of 
the asyrrnrictric unit. Initial local symmetry operators were detcr- 
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mined in O (Jones et al. 1991), refined with RAVE (Kleywegt and 
Jones 1999). and the electron densities were cyclically averaged 
using the program DM (Cowtan and Main 1998), This produced an 
easily intcrpretablc electron density map, allowing the unambigu- 
ous building of a complete molecule, including all residues from I 
to 218. A single NAG residue could be clearly identified cova- 
Icntly bound to Asn 164. Strict NCS constraints between the four 
molecules in the asymmetric unit were only used in the first rounds 
of refinement No restraints were applied in the final rounds. 

The changes in unit cell parameters of the T. reesei Cell2A 
A35V mutant compared with the wild-type structure required the 
use of molecular replacement, using Amoro (Navaza 1994) to 
solve the structure. Structure model building and refinement of the 
A35V mutant and the H. schweinitzii Cell2A structures were per- 
formed with alternating cycles of model building using O (Jones et 
al. 199 1), and maximum likelihood model refinement, bulk solvent 
corrections, anisotropic scaling, and automated bulk solvent cor- 
rection using CNS version 1,0 (Briinger et al. 1998). Tn the final 
rounds of structure refinement, Rcfmac 5.0 was used (Mursbudov 
etal. 1997). 

All water molecules in the structure models were located auto- 
matically by using the water-picking protocols in the refinement 
programs. These water molecules were then selected manually or 
discarded by inspection. A set of reflections from the data sets 
were used to monitor the R-free (Briinger L992a). PROCHECK 
(Laskowski ct al. 1993) was used to evaluate the stereo-chemical 

Table 2k Structure refinement and final model statistics 



T. reesei S. sp. UAQR H. schweinitzii 



GH 12 Protein 


A35V 


wild type 


wild type 


PDB access code 


loa2 


loa3 


loa4 


Resolution used in 








refinement (A) 


20-1.5 


29-1.5 


29-1.70 


Reflections hi: 








working set 


167.263 


25.464 


83,042 


test set 


5174 


821 


2573 


R & factor (*) 


20.1; 21.9 


18.1: 19.3 


19.2: 21.7 


Protein molecules 








in AU 


6 


J. 


4 


Protein atoms 


9978 


1686 


6648 


Water molecules 


643 


417 


6648 


(6) (Ar) 


20.4 


16-3 


J 1.2 


Protein 03) (A 2 ) 


202 


15.6 


10.5 


water <B> (A 2 ) 


24.5 


25.8 


19.0 


RMSD bond lengths 








from ideal (A)* 


0.018 


0.0119 


o.on 


RMSD bond angles 








from ideal CT 


1.7 


1.5 


1.3 


RMSD AB on bonded 








atoms (A 2 ) 


0.9 


1.0 


1.2 


RMSD NCS O* (A) 


0-5 




0.2 


RMSD NCS all 








atoms (A) 


0.7 




0.5 


Stringent Ramachandian 








outliers (%)> 


0.8 


10 


0.9 



Values were caicuiatcu with O (Jones « aL !99l; Jones and Kjeldgaard 
1997). CNS 1.0 (Briinger cl al. 1998), MOLEMAN (Kleywegt and Jones 
1996b). LSQMAN (Kleywegt and Jones 1997). Rcftnac 5.0 (Murshudov el 
al. 1997). 

* From £ugh and Huber (1991). 

^According to the stringent boundary definition of Kleywcst and Jonca 
(1996a), 
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quality of the final models. Summaries of model refinement sta- 
tistics arc given in Table 2b. 

All structural comparisons were made with O (Jones et al. 
1991). and figures were prepared with O and rendered with Mol- 
Ray (Harris and Jones 2001 ). Coordinates and structure-factor am- 
plitudes have been deposited with the Protein Data Bank (Bern- 
stein et al. 1977), and have access codes IETs los*2, loa3, and loa4 
for the T. reesei A35V, H. sckweinilzii. and £ sj>. J1AG8 Cel 12A 
enzymes* respectively. The final electron density maps arc avail- 
able for viewing as part of the electron density server (EDS) ser- 
vice at http^/a^htray.bmcuu-se/eds, with ID's loaA2, loa3, and 
loa4. 
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